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1 . (Currently Amended) A process for identifying statistically-outlying data points in at 
least one dataset, comprising: 

a) receiving the at least one dataset; and 

b) with a processing arrangement and using an adaptivelv selected 
multiscale strip function, identifying the statistically-outlying data points 
present in the at least one dataset based on4he information contained in 

the at least one dataset. 

2. (Original) The process of claim 1, wherein the at least one dataset comprises data 
associated with levels of gene expression obtained under two different conditions. 

3. (Original) The process of claim 2, wherein the two different conditions reflect an 
occurrence of at least one of a physiological process, a pathophysiological process, an 
oncogenic process, a mutational process, a pharmacologically-induced process, an 
immuno-precipitation-induced process, and a developmental process. 

4. (Original) The process of claim 1, further comprising one or more of the following 
steps: 

c) storing the at least one dataset in a matrix; 

d) shifting each row of the matrix by a center of mass of the at least one 

dataset; 

e) computing a principal axis of the at least one dataset; 

f) rotating the at least one dataset so that the principal axis coincides with x- 

axis; and 

g) generating strip functions that define boundaries outside which the 
statistically-outlying data points in the at least one dataset are located. 
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5. (Currently Amended) The process of claim 4, wherein the at least one dataset 
comprises the set E = {%■ of N points in R° . wherein D is an ambient dimension. R° is 
a D-dimensional Euclidean space. E is a data set of N points in the D-dimensional 
Euclidean space, and N is a number of points in a D-dimensional Euclidean space . 

6. (Original) The process of claim 4, wherein the strip functions that define boundaries 
that identify the statistically-outlying data points present in the at least one dataset are 
generated by computing a stopping point Fq using a top-down procedure. 

7. (Original) The process of claim 6, wherein the strip functions are smoothed by 
averaging of the strips generated from more than one determination. 

8. (Currently Amended) The process of claim 6, wherein a stopping point in the 
computation of Fq is set at Q's D{Qq)) if Fq, >a^ . wherein Fq is a cumulative fraction of 
outliers at an interval Q. Q' is a second interval. D(Qn) is a generalized dyadic grid with 
respect to an interval On . Fg - is a cumulative fraction of outliers at an interval defined bv 
Q=Q'. and an^is a positive parameter provided bv a user . 

9. (Withdrawn) The process of claim 6, wherein a stopping point in the computation of 
Fq is set at Q'e D{QJ) if |e| <«o . 

10. (Withdrawn) The process of claim 6, wherein a stopping point in the computation of 

Fq is set at g'e D{Q^)) if p, > 6o . 
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11. (Withdrawn) The process of claim 6, wherein a stopping point in the computation of 

Fq is set at 0'e D(Q,)) if |e'\d>a,.|e'|. 



12. (Original) The process of claim 6, wherein the stopping point in the computation of 
Fq is applied twice. 

13. (Currently Amended) A software arrangement operable by a processing 
arrangement for identifying the statistically-outlying data points present in at least one 
dataset based on the information contained in the at least one dataset, the software 
arrangement comprising: 

a) a first set of instructions operable to configure the processing arrangement 
to receive the at least one dataset; and 

b) a second set of instructions operable to configure , using an adaptivelv 
selected multiscale strip function, the processing arrangement to identify 
the statistically-outlying data points present in the at least one dataset 
based on-the information contained in the at least one dataset. 

14. (Original) The software arrangement of claim 13, wherein the at least one dataset 
comprises data associated with levels of gene expression obtained under two different 
conditions. 

15. (Original) The software arrangement of Claim 14, wherein the two different 
conditions reflect an occurrence of at least one of a physiological process, a 
pathophysiological process, an oncogenic process, a mutational process, a 
pharmacologically-induced process, an immuno-precipitation-induced process, and a 
developmental process. 
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16. (Original) The software arrangement of claim 13, further comprising at least one of 

the instructions: 

c) a third set of instructions operable to configure the processing 
arrangement to store the at least one dataset in a matrix; 

d) a fourth set of instructions operable to configure the processing 
arrangement to shift each row of the matrix by a center of mass of the at 
least one dataset; 

e) a fifth set of instructions operable to configure the processing arrangement 
to compute a principal axis of the at least one dataset; 

f) a sixth set of instructions operable to configure the processing 
arrangement to rotate the at least one dataset so that the principal axis 
coincides with x-axis; and 

g) a seventh set of instructions operable to configure the processing 
arrangement to generate strip functions that define boundaries outside 
which the statistically-outlying data points in the at least one dataset are 
located. 

17. (Currently Amended) The software arrangement of claim 16, wherein the at least 
one dataset comprises a set E = {xi}^, of N points in R °. wherein D is an ambient 
dimension, and R° is a D-dimensional Euclidean space. E is a data set of N points in 
the D-dimensional Euclidean space, and N is a number of points in a D-dimensional 
Euclidean space . 
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18. (Original) The software arrangement of claim 16, wherein the strip functions that 
define boundaries that identify the statistically-outlying data points present in the at least 
one dataset are generated by computing a stopping point Fq using a top-down 
procedure. 

19. (Original) The software arrangement of claim 18, wherein the strip functions are 
smoothed by averaging of the strips generated from more than one determination. 

20. (Currently Amended) The software arrangement of claim 18, wherein the stopping 
point in the computation of Fq is set atg'eD(go)) if Fr.,>a. . wherein Fr> is a 
cumulative fraction of outliers at an interval Q. Q' is a second interval. D(Q n ) is a 
generalized dyadic grid with respect to an interval Q o , F g; Is a cumulative fraction of 
outliers at an interval defined by Q=Q', and an_is a positive parameter provided by a 
user . 

21. (Withdrawn) The software arrangement of claim 18, wherein the stopping point in 
the computation of Fq is set at g'e D{Qq)) if d <«o ■ 



22. (Withdrawn) The software arrangement of claim 18, wherein the stopping point in 
the computation of Fq is set at g'e DiQ^)) if p. > 5o . 

Q 

23. (Withdrawn) The software arrangement of claim 18, wherein the stopping point in 
the computation of Fq is set at g'e D{Q^)) if g'\d>a,.b' . 
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24. (Original) The software arrangement of claim 18, wherein the stopping point In the 
computation of Fq is applied twice. 

25. (Currently Amended) A storage medium which includes thereon a software 
arrangement to be executed by a processing arrangement for identifying the 
statistically-outlying data points present In the at least one dataset based on the 
information contained in the at least one dataset, the software arrangement comprising: 

a) a first set of instructions operable to configure the processing arrangement 
to receive the at least one dataset; and 

b) a second set of instructions operable to configure , using an adaptivelv 
selected multlscale strip function, t he processing arrangement to identify 
the statistically-outlying data points present in the at least one dataset 
based on-the information contained in the at least one dataset. 

26. (Original) The storage medium of claim 25, wherein the at least one dataset 
comprises data associated with levels of gene expression obtained under two different 
conditions. 

27. (Original) The storage medium of claim 26, wherein the two different conditions 
reflect the occurrence of at least one of a physiological process, a pathophysiological 
process, an oncogenic process, a mutational process, a pharmacologically-induced 
process, an immuno-precipitation-induced process, and a developmental process. 

28. (Original) The storage medium of claim 25, wherein the software arrangement 
further comprises at least one of the following instructions: 
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c) a third set of instructions operable to configure the processing 
arrangement to store the at least one dataset in a matrix; 

d) a fourth set of instructions operable to configure the processing 
arrangement to shift each row of the matrix by a center of mass of the at 

least one dataset; 

e) a fifth set of instructions operable to configure the processing arrangement 
to compute a principal axis of the at least one dataset; 

f) a sixth set of instructions operable to configure the processing 
arrangement to rotate the at least one dataset so that the principal axis 
coincides with x-axis; and 

g) a seventh set of instructions operable to configure the processing 
arrangement to generate strip functions that define boundaries outside 
which the statistically-outlying data points in the at least one dataset are 
located. 

29. (Currently Amended) The storage medium of claim 28, wherein the at least one 
dataset comprises a set E = {xi}J!, of N points in R °. wherein D is an ambient 
dimension, and R° is a D-dimensional Euclidean space. E is a data set of N points in 
the D-dimensional Euclidean space, and N is a number of points in a D-dimensional 
Euclidean space . 

30. (Currently Amended) The storage medium of claim 28, wherein the strip functions 
that define boundaries that identify the statistically-outlying data points present in the at 
least one dataset are generated by computing a stopping point Fq using a top-down 
procedure. 
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31. (Original) The storage medium of claim 30, wherein the strip functions are 
smoothed by the averaging of the strips generated from more than one determination. 

32. (Currently Amended) The storage medium of claim 30, wherein the stopping point 
in the computation of Fq is set at Q'sDiQ^y) if F^.>a. , wherein F o is a cumulative 
fraction of outliers at an interval Q. Q' is a second interval. D(Qn) is a generalized dvadic 
grid with respect to an interval Qn. F,^- is a cumulative fraction of outliers at an interval 
defined bv Q=Q'. and an is a positive parameter provided by a user . 

33. (Withdrawn) The storage medium of claim 30, wherein the stopping point in the 
computation of Fq is set at g'e £)(go)) jsj <«o ■ 

34. (Withdrawn) The storage medium of claim 30, wherein the stopping point in the 
computation of Fq is set atQ'e D(Qq)) if p, > 8o . 

Q 

35. (Withdrawn) The storage medium of claim 30, wherein the stopping point in the 
computation of Fq is set atQ'e DiQ^)) if g*\d>a,J^* . 



36. (Original) The storage medium of claim 30, wherein the stopping point in the 
computation of Fq is applied twice. 

37. (Currently Amended) A system comprising: 

a processing arrangement operably configured to: 
a) receivem§ the at least one dataset; and 
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b) using an adaptively selected multiscale strip function, identifying tlie 
statistically-outlying data points present in the at least one dataset based 
on4he infornnation contained in the at least one dataset. 

38. (Currently Amended) The system of claim 37, further comprising a further 
processing arrangement configured to generate the at least one dataset. 

39. (Original) The system of claim 38, further comprising a detector configured to 
detect a plurality of signals indicative of gene expression and convert the detected 
signals into the at least one dataset. 

40. (Withdrawn) The method of claim 1, wherein the at least one data set comprises 
data associated with financial trends. 

41. (Withdrawn) The software arrangement of claim 13, wherein the at least one data 
set comprises data associated with financial trends. 

42. (Withdrawn) The storage medium of claim 25, wherein the at least one data set 
comprises data associated with financial trends. 

43. (New) The process of claim 1, further comprising: at least one of displaying or 
storing the statistically-outlying data points in a storage arrangement in at least one of a 
user-accessible format or a user-readable fomiat. 

44. (New) The software arrangement of claim 13, further comprising: a third set of 
instructions operable to configure the processing arrangement to at least one of display 
or store the statistically-outlying data points in a storage arrangement in at least one of 
a user-accessible format or a user-readable format. 

10 



[36048/US/2 - 475396.00173] 
PATENT 



45. (New) The storage medium of claim 25, wherein the software arrangement further 
comprises: a third set of instructions operable to configure the processing arrangement 
to at least one of display or store the statistically-outlying data points in a storage 
arrangement in at least one of a user-accessible format or a user-readable format. 

46. (New) The system of claim 37, wherein the processing arrangement is further 

operably configured to at least one of display or store the statistically-outlying data 
points in a storage arrangement in at least one of a user-accessible format or a user- 
readable format. 
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